Mining for Relevant Terms from Log Files

نویسندگان

  • Hassan Saneifar
  • Stéphane Bonniol
  • Anne Laurent
  • Pascal Poncelet
  • Mathieu Roche
چکیده

The Information extracted from log files of computing systems can be considered one of the important resources of information systems. In the case of Integrated Circuit design, log files generated by design tools are not exhaustively exploited. The logs of this domain are multi-source, multi-format, and have a heterogeneous and evolving structure. Moreover, they usually do not respect the grammar and the structures of natural language though they are written in English. According to features of such textual data, applying the classical methods of information extraction is not an easy task, more particularly for terminology extraction. We have previously introduced EXTERLOG approach to extract the terminology from such log files. In this paper, we introduce a new developed version of EXTERLOG guided by Web. We score the extracted terms by a Web and context based measure. We favor the more relevant terms of domain and emphasize the precision by filtering terms based on their scores. The experiments show that EXTERLOG is well-adapted terminology extraction approach from log files.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study on Various Web Mining Functionalities using Web Log Files

As the size of web increases along with number of users, it is very much essential for the website owners to better understand their customers so that they can provide better service, and also enhance the quality of the website. To achieve this they depend on the web access log files. The web access log files can be mined to extract interesting pattern so that the user behavior can be understoo...

متن کامل

Frequent Pattern Mining of Web Log Files Working Principles

Frequent pattern mining plays a major role in mining of web log files. Web usage mining is the one of the web mining process that involves application of mining techniques to web server logs to extract the behavior of users. A web usage mining consists of three important phases: data preprocessing, patterns discovery and pattern analysis. In data preprocessing phase the unwanted data are remove...

متن کامل

Learning about Online Learning Processes and Students' Motivation through Web Usage Mining

This study illustrates the potential of applying Web usage mining the analysis of Web log files in educational research. It consists of two sub-studies and focuses on two types of analysis, both related to the whole learning process: investigating one learner's activity in order to learn about her or his learning process, and examining the activity of a large group of learners, in order to deve...

متن کامل

Exploitation of Server Log Files of User Behavior in Order to Inform Administrator

Requests that users are sent to the Web server all in server log files are stored in a file called. It can be stated that users will see all pages or all user requests to the server, the file is stored. In this paper we investigate the behavior of users through web usage mining techniques and using server log files to help service administrators paid websites. At the end a web site downloading ...

متن کامل

LOGDIG log file analyzer for mining expected behavior from log files

Log files are often the only way to identify and locate errors in a deployed system. This paper presents a new log file analyzing framework, LOGDIG, for checking expected system behavior from log files. LOGDIG is a generic framework, but it is motivated by logs that include temporal data (timestamps) and system-specific data (e.g. spatial data with coordinates of moving objects), which are pres...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009